molecular graph
ComENet: Towards Complete and Efficient Message Passing for 3DMolecular Graphs
Many real-world data can be modeled as 3D graphs, but learning representations that incorporates 3D information completely and efficiently is challenging. Existing methods either use partial 3D information, or suffer from excessive computational cost. To incorporate 3D information completely and efficiently, we propose a novel message passing scheme that operates within 1-hop neighborhood.
1160792eab11de2bbaf9e71fce191e8c-Supplemental-Conference.pdf
The vocabulary Vconstructed by Algorithm 1 exhibits the following advantageous properties. Prior to the proof, we first present a clear observation of the created vocabulary V: Proposition A.2. Given any F,F V, for any their instances arising on an arbitrary molecule during the extraction process, either they are not spatially intersected F F =, or they contain each other: F F or F F. Now we prove each claim in the above theorem. We prove it by contradiction. If it is the former case, then Fi1 should be firstly extracted and then merged with other fragments to yield Fi2 which means i1 < i2, conflicting with the assumption.
Equivariant Blurring Diffusion for Hierarchical Molecular Conformer Generation
How can diffusion models process 3D geometries in a coarse-to-fine manner, akin to our multiscale view of the world?In this paper, we address the question by focusing on a fundamental biochemical problem of generating 3D molecular conformers conditioned on molecular graphs in a multiscale manner. Our approach consists of two hierarchical stages: i) generation of coarse-grained fragment-level 3D structure from the molecular graph, and ii) generation of fine atomic details from the coarse-grained approximated structure while allowing the latter to be adjusted simultaneously.For the challenging second stage, which demands preserving coarse-grained information while ensuring SE(3) equivariance, we introduce a novel generative model termed Equivariant Blurring Diffusion (EBD), which defines a forward process that moves towards the fragment-level coarse-grained structure by blurring the fine atomic details of conformers, and a reverse process that performs the opposite operation using equivariant networks.We demonstrate the effectiveness of EBD by geometric and chemical comparison to state-of-the-art denoising diffusion models on a benchmark of drug-like molecules.Ablation studies draw insights on the design of EBD by thoroughly analyzing its architecture, which includes the design of the loss function and the data corruption process.Codes are released at https://github.com/Shen-Lab/EBD.
Empowering Active Learning for 3D Molecular Graphs with Geometric Graph Isomorphism
Molecular learning is pivotal in many real-world applications, such as drug discovery. Supervised learning requires heavy human annotation, which is particularly challenging for molecular data, e.g., the commonly used density functional theory (DFT) is highly computationally expensive. Active learning (AL) automatically queries labels for most informative samples, thereby remarkably alleviating the annotation hurdle. In this paper, we present a principled AL paradigm for molecular learning, where we treat molecules as 3D molecular graphs. Specifically, we propose a new diversity sampling method to eliminate mutual redundancy built on distributions of 3D geometries.
Understanding the Limitations of Deep Models for Molecular property prediction: Insights and Solutions
Molecular Property Prediction (MPP) is a critical task in computational drug discovery, aimed at identifying molecules with desirable pharmacological and ADMET (absorption, distribution, metabolism, excretion, and toxicity) properties. Machine learning models have been widely used in this fast-growing field, with two types of models being commonly employed: traditional non-deep models and deep models.